Setup

The alignments in this analysis were generated by aligning each library (including technical replicates) to the Zebrafish transcriptome from Ensembl Release 94 (GRCz11) using kallisto (v0.43.1). In addition to the standard transcriptome, the two mutant psen2 transcripts were manually added to the reference.

The corresponding set of gene descriptions were then loaded into R as an EnsDb object using the AnnotationHub() infrastructure. Likewise, the set of transcript descriptions were loaded, with the manual addition of the two novel psen2 mutants.

Gene-level Counts

Gene-level counts were imported using tximport, mapping transcripts to genes.

Genes were retained for analysis if a CPM > 1 was observed for \(\geq\) 5 samples. This equated to about 31 reads for a gene in at least 5 samples for inclusion in downstream analysis, giving a total of 20,743 of the original genes for DGE analysis.

*Total counts from each library after assigning to genes*

Total counts from each library after assigning to genes

Counts were also processed using the voom transformation using quality weights to allow for analysis using normal-based algorithms. Sample weights ranged between 0.436 and 1.37, with the most strongly down-weighted being a WT sample.

Transcript-level Counts

Transcript-level counts were imported using catchKallisto() from edgeR in order to utilise the voom transformation on transcript-level counts.

*Sample weights using transcript-level counts, showing near identical patterns to those observed at the gene-level.*

Sample weights using transcript-level counts, showing near identical patterns to those observed at the gene-level.

Genotype checks

*CPM values for each psen2 transcript across all samples.*

CPM values for each psen2 transcript across all samples.

Transcript abundances (using CPM) were calculated for each of the three psen2 transcripts, and showed expected patterns of heterozygous expression for FAD samples and all WT expression for the WT samples. However for sample 8_FS_4, no WT allele was detected which is quite inexplicable, and this sample should be excluded from all analyses. The remaining FS samples showed reduced abundance of the FS transcript, as expected under NMD. No increases in expression of the WT allele were evident, supporting a lack of genetic compensation.

This sample was then removed from all objects, along with 12_WT_4 which had been consistently down-weighted.

Data Inspection

The next step was to perform an MDS analysis. However, minimal separation was observed between sample groups, A simple PCA also revealed that the first few principal components capture less of the total variability than might be expected,

MDS plot showing no clear groups within the data. Point sizes indicate sample weights as calculated by voomWithQualityWeights().

First five principal components, showing that the first two only account for 28.7% of the total variance, which is below expectations
  PC1 PC2 PC3 PC4 PC5
Standard deviation 28.94 25.91 24.47 22.66 21.04
Proportion of Variance 0.1593 0.1277 0.1139 0.0977 0.08418
Cumulative Proportion 0.1593 0.287 0.4009 0.4986 0.5828

DGE Analysis

Design

Three comparisons were defined with the first two being the difference between the two mutant families and the wild-type samples. The third comparison was defined as being between the two mutant groups.

FS Vs WT

The first analysis was comparing psen2N140fs/+ samples to psen2+/+ samples. A total of 5 genes were potentially detected as differentially expressed using an FDR of 5%. In the following plots, a negative value for logFC corresponds to decreased expression in the heterozygous mutants.

*MD plot for psen2^N140fs/+^ samples compared to psen2^+/+^ samples*

MD plot for psen2N140fs/+ samples compared to psen2+/+ samples

*Volcano plot for psen2^N140fs/+^ samples compared to psen2^+/+^ samples*

Volcano plot for psen2N140fs/+ samples compared to psen2+/+ samples

10 most highly ranked genes in the comparison between psen2N140fs/+ samples and psen2+/+ samples
ID Symbol logFC AveExpr P.Value FDR Brief Description
ENSDARG00000015540 psen2 -0.6217 4.581 1.587e-07 0.002113 presenilin 2
ENSDARG00000116774 CABZ01035279.1 -9.702 0.2333 2.037e-07 0.002113 PEST proteolytic signal-containing…
ENSDARG00000115710 si:ch211-160d14.6 -8.488 -0.06027 2.22e-06 0.01535 si:ch211-160d14.6
ENSDARG00000086977 atxn1l -0.7352 2.468 7.827e-06 0.04059 ataxin 1-like
ENSDARG00000076176 ptcd1 0.8676 2.504 1.032e-05 0.04281 pentatricopeptide repeat domain…
ENSDARG00000115219 CU179663.1 -0.9117 3.701 2.782e-05 0.09616 ATPase phospholipid transporting…
ENSDARG00000112605 BX649405.1 -1.044 2.183 5.868e-05 0.1739 pentatricopeptide repeat domain…
ENSDARG00000089477 si:ch211-132g1.3 -0.4391 5.539 0.0001235 0.2866 si:ch211-132g1.3
ENSDARG00000021265 mybpc2b 5.46 1.16 0.0001322 0.2866 myosin binding protein…
ENSDARG00000004597 lrrc4ba -0.4269 4.793 0.0001381 0.2866 leucine rich repeat…
*Expression patterns for significantly DE genes in the comparison between psen2^N140fs/+^ samples and psen2^+/+^ samples*

Expression patterns for significantly DE genes in the comparison between psen2N140fs/+ samples and psen2+/+ samples

*Expression patterns for the next most highly ranked genes in the comparison between psen2^N140fs/+^ samples and psen2^+/+^ samples, but which are not formally considered as DE*

Expression patterns for the next most highly ranked genes in the comparison between psen2N140fs/+ samples and psen2+/+ samples, but which are not formally considered as DE

FAD Vs WT

The next analysis was comparing psen2T141_L142delinsMISLISV/+ samples to psen2+/+ samples. No genes could be considered as DE using an FDR anywhere up to 50%. In the following plots, a negative value for logFC corresponds to decreased expression in the heterozygous mutants.

*MD plot for psen2^T141_L142delinsMISLISV/+^ samples compared to psen2^+/+^ samples*

MD plot for psen2T141_L142delinsMISLISV/+ samples compared to psen2+/+ samples

*Volcano plot for psen2^T141_L142delinsMISLISV/+^ samples compared to psen2^+/+^ samples*

Volcano plot for psen2T141_L142delinsMISLISV/+ samples compared to psen2+/+ samples

10 most highly ranked genes in the comparison between psen2T141_L142delinsMISLISV/+ samples and psen2+/+ samples
ID Symbol logFC AveExpr P.Value FDR Brief Description
ENSDARG00000090646 tnk2a 0.2703 5.582 2.58e-05 0.5043 tyrosine kinase, non-receptor,…
ENSDARG00000093677 si:ch211-56a11.2 1.043 1.221 7.147e-05 0.5043 si:ch211-56a11.2
ENSDARG00000103829 si:ch73-236c18.2 1.188 2.075 0.000117 0.5043 si:ch73-236c18.2
ENSDARG00000109549 BX548026.1 -0.7812 0.9629 0.0001418 0.5043 NULL
ENSDARG00000062948 wasf3b -0.4392 5.684 0.0001617 0.5043 WAS protein family,…
ENSDARG00000116688 hs6st1b 7.641 -1.831 0.0001677 0.5043 heparan sulfate 6-O-sulfotransferase…
ENSDARG00000086977 atxn1l -0.5024 2.468 0.0001702 0.5043 ataxin 1-like
ENSDARG00000055797 cnpy4 0.9893 0.2136 0.0002455 0.6364 canopy FGF signaling…
ENSDARG00000094246 si:dkey-222h21.10 1.025 1.939 0.0003698 0.6721 si:dkey-222h21.10
ENSDARG00000112102 nrros -1.027 0.03725 0.000468 0.6721 negative regulator of…
*Expression patterns for the 5 most highly ranked genes in the comparison between psen2^T141_L142delinsMISLISV/+^ samples and psen2^+/+^ samples. None were considered as DE*

Expression patterns for the 5 most highly ranked genes in the comparison between psen2T141_L142delinsMISLISV/+ samples and psen2+/+ samples. None were considered as DE

FAD Vs FS

The final analysis was comparing psen2T141_L142delinsMISLISV/+ samples to psen2N140fs/+ samples. A total of 3 genes were potentially detected as differentially expressed using an FDR of 5%. In the following plots, a negative value for logFC corresponds to decreased expression in psen2T141_L142delinsMISLISV/+ samples, whilst a positive value for logFC corresponds to increased expression in psen2T141_L142delinsMISLISV/+ samples.

*MD plot for psen2^T141_L142delinsMISLISV/+^ samples compared to psen2^N140fs/+^ samples*

MD plot for psen2T141_L142delinsMISLISV/+ samples compared to psen2N140fs/+ samples

*Volcano plot for psen2^T141_L142delinsMISLISV/+^ samples compared to psen2^N140fs/+^ samples*

Volcano plot for psen2T141_L142delinsMISLISV/+ samples compared to psen2N140fs/+ samples

10 most highly ranked genes in the comparison between psen2T141_L142delinsMISLISV/+ samples and psen2N140fs/+ samples
ID Symbol logFC AveExpr P.Value FDR Brief Description
ENSDARG00000015540 psen2 0.7082 4.581 2.248e-08 0.0004663 presenilin 2
ENSDARG00000116774 CABZ01035279.1 8.739 0.2333 4.248e-07 0.004406 PEST proteolytic signal-containing…
ENSDARG00000115710 si:ch211-160d14.6 7.687 -0.06027 4.205e-06 0.02907 si:ch211-160d14.6
ENSDARG00000115219 CU179663.1 0.9426 3.701 1.346e-05 0.06977 ATPase phospholipid transporting…
ENSDARG00000078246 si:ch211-114l13.4 1.649 2.09 1.781e-05 0.07072 si:ch211-114l13.4
ENSDARG00000103829 si:ch73-236c18.2 1.458 2.075 2.046e-05 0.07072 si:ch73-236c18.2
ENSDARG00000094346 si:ch211-114l13.3 2.013 -0.1731 3.164e-05 0.09376 si:ch211-114l13.3
ENSDARG00000094297 si:dkey-222h21.2 2.044 2.341 4.871e-05 0.1263 si:dkey-222h21.2
ENSDARG00000112605 BX649405.1 0.9878 2.183 6.796e-05 0.1566 pentatricopeptide repeat domain…
ENSDARG00000090646 tnk2a 0.2427 5.582 8.654e-05 0.1795 tyrosine kinase, non-receptor,…
*Expression patterns for significantly DE genes in the comparison between psen2^T141_L142delinsMISLISV/+^ samples and psen2^N140fs/+^ samples. This is essentially a subset of the previously identified genes*

Expression patterns for significantly DE genes in the comparison between psen2T141_L142delinsMISLISV/+ samples and psen2N140fs/+ samples. This is essentially a subset of the previously identified genes

*Expression patterns for the next most highly ranked genes in the comparison between psen2^T141_L142delinsMISLISV/+^ samples and psen2^N140fs/+^ samples, but which are not formally considered as DE*

Expression patterns for the next most highly ranked genes in the comparison between psen2T141_L142delinsMISLISV/+ samples and psen2N140fs/+ samples, but which are not formally considered as DE

Differential Transcript Expression

As the level of transcript complexity is less in zebrafish than human, and 1:1 mapping between species is less robust, only a brief analysis was performed at the transcript level. In essence, the same genes were found as the most highly ranked, with changes in expression of psen2 transcripts detected as expected, providing a form of positive control. Following the top tables, the basic transcript expression patterns are shown for three possible genes of interest. Notably, the transcripts showing the strongest differential expression are expressed at very low-levels for both si:ch211-132g1.3 and slc37a4b.

10 most highly ranked transcripts in the comparison between psen2N140fs/+ samples and psen2+/+ samples
Transcript Symbol logFC AveExpr P.Value FDR gene_id
ENSDART00000137332 si:ch211-132g1.3 -6.333 -1.451 2.801e-07 0.008417 ENSDARG00000089477
ENSDART00000187524 CABZ01035279.1 -8.439 -0.1248 5.656e-07 0.008498 ENSDARG00000116774
ENSDART00000127351 atxn1l -0.7309 2.88 5.628e-06 0.05638 ENSDARG00000086977
ENSDART00000114613 ptcd1 0.8702 2.581 8.968e-06 0.06737 ENSDARG00000076176
psen2N140fs psen2 3.401 -4.421 1.931e-05 0.116 ENSDARG00000015540
ENSDART00000185608 si:ch211-160d14.6 -6.728 -1.329 2.352e-05 0.1178 ENSDARG00000115710
ENSDART00000188158 BX649405.1 -1.041 2.11 6.452e-05 0.273 ENSDARG00000112605
ENSDART00000143184 mybpc2b 5.446 1.012 7.267e-05 0.273 ENSDARG00000021265
ENSDART00000006381 psen2 -0.9832 1.357 8.181e-05 0.2731 ENSDARG00000015540
ENSDART00000182716 actb1 -4.621 -2.578 9.602e-05 0.2885 ENSDARG00000113649
10 most highly ranked transcripts in the comparison between psen2T141_L142delinsMISLISV/+ samples and psen2+/+ samples
Transcript Symbol logFC AveExpr P.Value FDR gene_id
psen2T141_L142delinsMISLISV psen2 6.452 -2.98 1.806e-11 5.427e-07 ENSDARG00000015540
ENSDART00000168837 fam168b 1.906 2.399 3.9e-05 0.3795 ENSDARG00000101733
ENSDART00000006381 psen2 -0.9714 1.357 4.901e-05 0.3795 ENSDARG00000015540
ENSDART00000144157 si:ch211-56a11.2 1.025 1.713 5.052e-05 0.3795 ENSDARG00000093677
ENSDART00000186112 cntnap2a 0.3749 3.807 9.329e-05 0.4123 ENSDARG00000058969
ENSDART00000180982 hs6st1b 6.271 -2.079 0.0001158 0.4123 ENSDARG00000116688
ENSDART00000168762 si:ch73-236c18.2 1.184 0.9583 0.0001202 0.4123 ENSDARG00000103829
ENSDART00000172408 arhgap11a -1.641 -0.7479 0.0001313 0.4123 ENSDARG00000100019
ENSDART00000127351 atxn1l -0.5008 2.88 0.000136 0.4123 ENSDARG00000086977
ENSDART00000091529 wasf3b -0.4363 5.912 0.0001372 0.4123 ENSDARG00000062948
10 most highly ranked transcripts in the comparison between psen2T141_L142delinsMISLISV/+ samples and psen2N140fs/+ samples
Transcript Symbol logFC AveExpr P.Value FDR gene_id
psen2T141_L142delinsMISLISV psen2 6.428 -2.98 2.399e-11 7.208e-07 ENSDARG00000015540
ENSDART00000137332 si:ch211-132g1.3 5.577 -1.451 7.251e-07 0.01089 ENSDARG00000089477
ENSDART00000187524 CABZ01035279.1 7.413 -0.1248 1.465e-06 0.01467 ENSDARG00000116774
ENSDART00000150193 slc37a4b -1.158 -0.05288 8.973e-06 0.06741 ENSDARG00000077180
psen2N140fs psen2 -3.401 -4.421 1.187e-05 0.07134 ENSDARG00000015540
ENSDART00000168762 si:ch73-236c18.2 1.452 0.9583 1.894e-05 0.09485 ENSDARG00000103829
ENSDART00000147678 si:dkey-222h21.2 2.015 0.6667 3.399e-05 0.1445 ENSDARG00000094297
ENSDART00000141678 si:ch211-114l13.3 1.96 -1.02 3.847e-05 0.1445 ENSDARG00000094346
ENSDART00000188136 CABZ01084501.2 0.6338 4.433 7.284e-05 0.2116 ENSDARG00000113332
ENSDART00000185608 si:ch211-160d14.6 5.708 -1.329 7.721e-05 0.2116 ENSDARG00000115710